Remove unnecessary CUDA sync of qwen image and video preprocess #22792

cyyever · 2025-08-13T07:01:03Z

Essential Elements of an Effective PR Description Checklist

This PR removes unnecessary CUDA sync in _process_image_input and _process_video_input of vllm/model_executor/models/qwen2_5_vl.py by utilising grid_thw_list.

Before the fix, the _process_image_input function of vllm/model_executor/models/qwen2_5_vl.py could take near 1/4 of the OwnTime as measured by py-spy, see:

After the fix, the bottlenecks have been moved to other places, and there are further fixes. However, this fix is quite simple that I want to submit first.

gemini-code-assist

Code Review

This pull request aims to remove unnecessary CUDA synchronizations in qwen2_5_vl.py for performance improvement. The changes in _process_image_input and _process_video_input correctly identify a source of synchronization but introduce a correctness bug when handling empty inputs. My review provides critical feedback with suggestions to fix this bug while still achieving the desired performance gain.

vllm/model_executor/models/qwen2_5_vl.py

github-actions · 2025-08-13T07:03:21Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337

Thanks, can you provide some profiling or benchmark results to show the effectiveness of this optimization?

cyyever · 2025-08-13T09:19:46Z

@DarkLight1337 Added.

Signed-off-by: cyy <cyyever@outlook.com>

vllm/model_executor/models/qwen2_5_vl.py

Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

Signed-off-by: cyy <cyyever@outlook.com>

DarkLight1337

Thanks for the quick response, LGTM!

…-project#22792) Signed-off-by: cyy <cyyever@outlook.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

…-project#22792) Signed-off-by: cyy <cyyever@outlook.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

…-project#22792) Signed-off-by: cyy <cyyever@outlook.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

…-project#22792) Signed-off-by: cyy <cyyever@outlook.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

cyyever requested a review from sighingnow as a code owner August 13, 2025 07:01

mergify bot added the qwen Related to Qwen models label Aug 13, 2025

gemini-code-assist bot reviewed Aug 13, 2025

View reviewed changes

vllm/model_executor/models/qwen2_5_vl.py Outdated Show resolved Hide resolved

vllm/model_executor/models/qwen2_5_vl.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Aug 13, 2025

View reviewed changes

Fix CUDA sync of qwen image and video preprocess

ee26070

Signed-off-by: cyy <cyyever@outlook.com>

cyyever force-pushed the qwen_fix branch from 33ab507 to ee26070 Compare August 13, 2025 09:36

DarkLight1337 reviewed Aug 13, 2025

View reviewed changes

vllm/model_executor/models/qwen2_5_vl.py Outdated Show resolved Hide resolved

cyyever and others added 2 commits August 13, 2025 18:14

Update vllm/model_executor/models/qwen2_5_vl.py

24215cf

Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: Yuanyuan Chen <cyyever@outlook.com>

Apply sugguestion

19d7685

Signed-off-by: cyy <cyyever@outlook.com>

DarkLight1337 approved these changes Aug 13, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 13, 2025 10:21

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 13, 2025

vllm-bot merged commit 6772bb0 into vllm-project:main Aug 13, 2025
41 of 47 checks passed

cyyever deleted the qwen_fix branch August 15, 2025 13:03

ywang96 mentioned this pull request Aug 29, 2025

[MM Encoder] General encoder performance improvement #23884

Open

1 task

This was referenced Sep 5, 2025

[Model] Remove unnecessary CUDA sync of GLM-4.1V image and video preprocess #24332

Merged

[Model] Remove unnecessary CUDA sync of Qwen2VL image and video preprocess #24334

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove unnecessary CUDA sync of qwen image and video preprocess #22792

Remove unnecessary CUDA sync of qwen image and video preprocess #22792

Uh oh!

cyyever commented Aug 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 13, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

cyyever commented Aug 13, 2025

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Remove unnecessary CUDA sync of qwen image and video preprocess #22792

Remove unnecessary CUDA sync of qwen image and video preprocess #22792

Uh oh!

Conversation

cyyever commented Aug 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 13, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

cyyever commented Aug 13, 2025

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cyyever commented Aug 13, 2025 •

edited by github-actions bot

Loading